AITopics | distilling prioritized path

Cream of the Crop: Distilling Prioritized Paths For One-Shot Neural Architecture Search

Neural Information Processing SystemsDec-24-2025, 16:01:59 GMT

One-shot weight sharing methods have recently drawn great attention in neural architecture search due to high efficiency and competitive performance. However, weight sharing across models has an inherent deficiency, i.e., insufficient training of subnetworks in the hypernetwork. To alleviate this problem, we present a simple yet effective architecture distillation method. The central idea is that subnetworks can learn collaboratively and teach each other throughout the training process, aiming to boost the convergence of individual models. We introduce the concept of prioritized path, which refers to the architecture candidates exhibiting superior performance during training.

distilling prioritized path, name change, one-shot neural architecture search, (9 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Cream of the Crop: Distilling Prioritized Paths For One-Shot Neural Architecture Search--- -- Supplementary Material--- -- Appendix A Input Shape Operators Channels Repeat Stride 224 2

Neural Information Processing SystemsAug-16-2025, 13:49:54 GMT

The "Repeat" represents the maximum number of repeated blocks in a group. " represents the inverted bottleneck " represents the inverted bottleneck

distilling prioritized path, kernel size, one-shot neural architecture search, (10 more...)

Neural Information Processing Systems

Country: North America > Canada (0.06)

Technology:

Information Technology > Artificial Intelligence > Systems & Languages > Problem-Independent Architectures (0.42)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.42)
Information Technology > Artificial Intelligence > Cognitive Science (0.42)

Add feedback

Review for NeurIPS paper: Cream of the Crop: Distilling Prioritized Paths For One-Shot Neural Architecture Search

Neural Information Processing SystemsFeb-6-2025, 13:21:37 GMT

Weaknesses: The search space is not the same as the google publications but similar to once-for-all. The se-ratio is 0.25 in this paper's code, the expansion rates are {4,6} in this paper and the maximum depth is 5 in every stage, slightly different. Thus, please report #params in Tab. 1. L120. In this paper, the author uses 2K images as the validation set (L212) and use the validation loss to train the meta-network M. I'm curious that the author claim that this step is time-consuming (L159), then how many iterations in total are used for updating M in this paper? The Kendall rank is important, and I prefer more results.

Add feedback

Review for NeurIPS paper: Cream of the Crop: Distilling Prioritized Paths For One-Shot Neural Architecture Search

Neural Information Processing SystemsFeb-6-2025, 13:21:30 GMT

Reviewers like the proposed method and the experimental results. There are still a number of issues raised by reviewer 3 that could be addressed or discussed to improve the paper.

distilling prioritized path, neurips paper, one-shot neural architecture search

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Systems & Languages > Problem-Independent Architectures (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.40)
Information Technology > Artificial Intelligence > Cognitive Science (0.40)

Add feedback

Cream of the Crop: Distilling Prioritized Paths For One-Shot Neural Architecture Search

Neural Information Processing SystemsOct-11-2024, 10:02:27 GMT

One-shot weight sharing methods have recently drawn great attention in neural architecture search due to high efficiency and competitive performance. However, weight sharing across models has an inherent deficiency, i.e., insufficient training of subnetworks in the hypernetwork. To alleviate this problem, we present a simple yet effective architecture distillation method. The central idea is that subnetworks can learn collaboratively and teach each other throughout the training process, aiming to boost the convergence of individual models. We introduce the concept of prioritized path, which refers to the architecture candidates exhibiting superior performance during training.

distilling prioritized path, one-shot neural architecture search, subnetwork, (4 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Systems & Languages > Problem-Independent Architectures (0.64)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.64)

Add feedback

Filters

Collaborating Authors

distilling prioritized path

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Cream of the Crop: Distilling Prioritized Paths For One-Shot Neural Architecture Search

Cream of the Crop: Distilling Prioritized Paths For One-Shot Neural Architecture Search--- -- Supplementary Material--- -- Appendix A Input Shape Operators Channels Repeat Stride 224 2

Review for NeurIPS paper: Cream of the Crop: Distilling Prioritized Paths For One-Shot Neural Architecture Search

Review for NeurIPS paper: Cream of the Crop: Distilling Prioritized Paths For One-Shot Neural Architecture Search

Cream of the Crop: Distilling Prioritized Paths For One-Shot Neural Architecture Search